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ABSTRACT 

Nowadays online shopping is emerging as a growth of business. Customers are getting used to purchasing the items online. 
Online reviews are an essential resource for users choosing to buy a product, watch a movie or go to a hotel. When it needs to 
decide the items/products through online, the opinions of other users through review matter a lot. It gives a good idea of the 
product to be purchasable or not. However, people face the information overloading problem. So the problem is as to how to 
get valuable information from user reviews so as to understand a user’s preference and make an accurate recommendation. 
Recommender systems become risen as an essential tool to overwhelm the negative result of information overloading problem. 
The traditional recommendation system examines some factors like the user’s buying records, product classification, and user's 
geographic location. This paper is an attempt to discuss the three social factors with some rating prediction algorithms based 
on user sentiment similarity, item reputation and user circle influence and review the applicable sentiment dictionary to the 
recommender system. 
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1. INTRODUCTION 

Today online textual reviews play very important role in the 
decision process. Most of the websites take reviews of the 
user so as to develop their business. For example, it will be 
easy for the user to purchase the products online as another 
user gives much more information about the product 
through reviews.When it comes to online shopping people 
pay much more attention to other user reviews, especially in 
user's circle friends.User's reviews will help in rating 
prediction as high-quality products will get attached with 
good user reviews. Hence, as to how extract useful reviews 
of the users' and the relationship among the reviewers in the 
social networking platform has broadly discussed in web 
mining. 

Though, the user's rating is not always available on many 
websites. As reviews contain quite detail about the product 
information and the user opinion information, which have a 
great impact on a user’s decision. There is a possibility that 
not every user prefers to rate every item on a website. 
Therefore, there are many unrated items still left in a user- 
item matrix. It is discussed in several rating prediction 
methods, e.g. [1], [4], As we know user reviews are always 
available on the website. In such situation, it’s useful and 
essential to parse the user's reviews to help to predict the 
unrated product/items for the user. The growth of review 
websites gives a view of mining the user choices and 
predicting user’s ratings. Sentiment analysis is mostly used 
for extracting the user's interest. In general, the sentiment is 
practiced to describe user's own opinions about the items. 
Generally, reviews are categorized into three groups, 
positive, negative and neutral. However, it is quite difficult 
for the user's to take a decision when all candidate items 
show positive sentiment, negative sentiment or neutral 
sentiment. Customers not only pay attention towards better 
product but want to know how extremely good the product 
is. It is known that different users may have different 
opinions expression priority. For eg., users may use word 
"fine" to define a “splendid” product, while others may prefer 
to use "fine" to define a "wonderful" product [11].. That is 


users are more concerned about item’s reputation. For the 
product/item reputation, sentiment of the reviews is 
required. Normally, if the item's reviews show positive 
sentiment, then the item considered as a good reputation, 
but if the item's reviews show a negative opinion then the 
item considered to be a bad reputation. If we understand 
user sentiment, we can easily know the item reputation and 
even the user ratings. When we explore the internet for 
online purchasing, both positive and negative reviews are 
important to be a preference. It is observed that one user's 
review will influence the other users and, Users review gives 
more information about the product. Despite, it's hard to 
guess the user's textual sentiment .So there is much need to 
pay attention to the interpersonal influence so as to extract 
the user preferences .Most of the methods of the 
interpersonal social influence in social networking platform 
have shown good performance in the recommendation, 
which can appropriately solve the "cold start" problems and 
"sparsity problem". Despite, the existing approaches [2], [3], 
[5], [6], [9] mainly focus on product class information or tag 
information to study the interpersonal social influence of the 
user. These methods only use for structured data, which is 
not always available on every website. Though user reviews 
may be useful in mining the interpersonal social inference 
and user choices. 
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2. OBJECTIVE 

> To improve the accuracy rating prediction. 

> To perform a sentiment analysis on textual reviews. 

> To overcome the sparsity problem. 

> To implement a recommender system. 

3. RELATED WORK 

In this paperwork, we survey some methods of collaborative 
filtering technique, the rating prediction approaches of 
matrix factorization (MFJ, review based approaches, and 
sentiment analysis. 

3.1 Collaborative Filtering 

The collaborative filtering scheme is used to predict the user 
preferences for the unrated items and after that, it 
recommends the most preferred items out of the list to the 
users. Nowadays it is mostly used recommender system 
technique. It provides the best preferences to the user. Many 
website using this technique like Amazon, twitter,etc.. As we 
are aware that 33% of sales of Amazon is just because of 
recommender system which uses the collaborative 
technique. Algorithms [18], [12], [24], [26], [35] have 
already been devised so as to get better the quality of the 
recommender system. An algorithm for Collaborative 
approach is CF method which is an old algorithm. The basic 
thought is that users prefer to get those products which they 
used to purchase after taking a look on their history 
preferences. Sutter [9] propose a method that enables tags to 
get fused into CF algorithms and combine the three- 
dimensional relationships between items, users, and tags. 
Moreover, in method [12] it provides a user's rating to an 
item by determining the average ratings of related or 
correlated items by the same user. Its performance enhances 
when determining the similarity between items. S.Gao [28] 
proposed a collaborative filtering recommendation scheme 
based on topic relationship, they assume that experts with 
related topics would possess similar feature vectors. 

3.2 Matrix Factorization based Methods 

3.2.1 Basic Matrix Factorization 

The matrix factorization method used for low-dimensional 
matrix decomposition. It is the product of marices of a 
matrix.. There are various distinct matrix decompositions, all 
find use amongst a selective class of queries. These methods 
have shown to be useful for predicting the user decisions 
from observed user rating matrix. A matrix is presumed by 
decomposing the user reviews what users assigned to the 
product. Matrix factorization approaches are recommended 
for social recommender system due to their capability to 
handle the large datasets. For collaborative scheme there are 
many matrix factorization schemes have been devised. From 
the Basic matrix factorization[1], a potential eigenvector 
matrix is used for both the Recommendation users and 
items, and it calculated all the rating value. 

3.2.2 Social Recommendation 

Some matrix factorization that are based on social 
recommendations is meant to resolve the "cold start" 
problems. In today life, people’s decision is often influenced 
by the friends’ action or recommendation. People tend to 
influence when it comes to buying the items. How to get 
Social information is broadly analyzed. Yang [2] Propose the 
"Trust Circles" in a social networking platform based on 
matrix factorization. Jiang [3] introduce another important 
factor, the personal preference. Some sites always not offer 
the structured information. These approaches are only 


suitable for structured information, but not for the 
unstructured data(i.e. textual data]. Hence, social 
information of each user is not available and it is quite 
complex to offer a reliable prediction for each user. To solve 
the issue, the sentimental factor is used to enhance social 
recommendation. 

3.3 Applications based on Reviews 

There are several recommendation tasks has been done on 
the basis of user reviews or comments. Most websites using 
user's reviews for the growth of their business. Qu [13] 
propose a method which predicts a user’s numerical rating 
in an item review, and they use a constraint regression 
technique for determining the scores of sentimental 
opinions. Jang [10] propose a rating review prediction 
system by taking the social connections of a user or 
reviewer. It is used to analyze the social relation of 
user's/reviewers into strong and ordinary connection. Zhang 
[16] include various item review factors such as product 
quality, content, time of the review, durability of the item 
and positive reviews of the customer. They perform a 
product ranking model that applies weights to product/item 
review factors so as to calculate the ranking score. Ling et al. 
[2 3] proposes a model that fuses content-based collaborative 
filtering and examining the data of the ratings as well as the 
reviews. Luo [17] identify and resolve a new problem: aspect 
recognition and rating collectively with total rating 
prediction for unrated reviews. They propose an LDA-style 
topic design which makes ratable aspects of sentiment and 
associated modifiers with rating. 

3.4 Sentiment-based Applications 

It is used to check whether the text is negative, positive or 
neutral. This is also called as opinion mining, concluding the 
opinion or emotion of an announcer. Sentiment analysis 
opinion is basically directed on review-level, sentence-level, 
and phrase-level. Review-level [20], [21] and sentence-level 
analysis [22] tried to analyze the entire review as a positive, 
neutral and negative. Phrase-level [26], [24] try to extract 
the feature of the item rooted on the feature likings of the 
user.The main idea of using phrase-level sentiment analysis 
is the development of sentimentlexicon. Pang [20] propose a 
contextual-insensitive evaluative lexical approach. Though, it 
can't deal with the mismatch between the base valence of the 
term and the author’s usage. Polanyi [18] explain that base 
valence of a lexical product is transformed by lexical and 
context.it also proposed a method for some contextual 
shifters. They measure user sentiment opinion based on a 
finer-grained approach. Taboada [19] exhibit a semantic 
familiarization calculator which uses the dictionaries of 
words interpreted with their semantic familiarization 
[polarity and strength] and incorporates intensification and 
converse.There are several schemes to sentiment analysis 
used to solve personalized suggestion [8], [25], [26], [27], 
Zhang [8] propose a self-managed and lexicon-based 
sentiment distribution approach to find out the sentiment of 
a review that comprises both text words and emotions. They 
use the sentiment for the recommendation of the product. 
With analyzing ratings of a user, they can infer specific 
experts to a destination user based on the people population. 
The data held in the user-service cooperations can help in 
predict the friendship propagations. Then the data is 
collected from user-item interaction and user-user 
connection. Lee [25] introduce a recommendation system 
utilizing the idea of Experts to determine both novel and 
relevant recommendations. From parsing the ratings, of the 
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user it can suggest special experts to a target user based on 
the people population. Lei [27] work on phrase-level opinion 
analysis to conclude a particular item’s reputation. They also 
propose the concept of "virtual friends" to model items’ 
relations, which can reduce the time complexity during 
training. Zhang [26] recommend an EFM model to make an 
understandable recommendation, they derive explicit item 
features and user choices by phrase-level sentimental 
analysis of user reviews. 

4. LDA APPROACH 

4.1 Product Features Extraction 

Product features mean discussing theissuesofa product. In 
this paperwork, we review the extracted item features of 
textual reviews utilizing LDA [7]. LDA is a generative topic 
pattern extractor. This technique applied to statistics, 
pattern recognizing and machine learning so as to get a 
linear sequence of characteristics that defines or classifies 
two or more classes of objects or events. The features are 
named entities, product attributes etc. As LDA is an 
analytical model which is used to exhibit the relationship 
between topics, reviews, and words. LDA describes 
documents as mixtures of topics eject out words with several 
probabilities. It works as following steps when writing the 
documents: 

I. Decide the number of words forthe N document (using 
Poisson distribution], 

II. Pick a topic mix for the record/document with the help 
of the Dirichlet distribution fixed set of L topics. 

III. Produce each word w in the document by: 

A. Selecting a topic (by using multinomial distribution) 

B. Produce the word utilizing the topic (topic’s 
multinomial distribution). 

4.2 Data Pre-processing 

By filtering process, words are gathered from the user's 
reviews positive words, neutral words, negative words and 
the sentiments degree of words. It also filters out "Stop 
Words" [14,15] like a pronoun, article, etc. All the different 
words of the reviews are included in vocabulary. 

4.3 Generating The Process 

It studies all the user's document as D and the number of 
topics expressed as m. The output will be represented as 
topic preference distribution of each user and topic list 
consists of 10 features words. 


Tablel: Sentiment Dictionary 


Levels 

Product Features 

Level 1 

Best, Awesome, Superb, Amazing 

Level 2 

Very, Better, Fine, Slightly 

Level 3 

Comparative, Further, Rather, Few 

Level 4 

Some, Just, A little bit, Quite 

Level 5 

Less, Little, Insufficient, Not very 


5. RATING PREDICTION ALGORITHMS 

There are many algorithms has been offered for rating 
prediction.In this review paper, we study some algorithms of 
rating prediction. 

A. Basic MF: It is baseline matrix factorization method 
approach [1], This method doesn't contain any the social 
factors. 

B. Circle Con: It is proposed in [8], this method uses the 
interpersonal trust factor in the social networking 
platform.By using matrix factorization it suggests the 


trust circles. 

C. Context MF: This method used the social factors; 
interpersonal influence and individual preferences.This 
method [3] enhances the performance of traditional 
item-based collaborative filtering technique [12], [24], 

D. PRM: This approach proposed in [5]. It gets to consider 
the three social factors, i.e. interpersonal interest 
similarity, interpersonal influence, and personal 
interest. It also used the matrix factorization for 
prediction of the user's ratings. 

E. EFM: This approach [26],builds two specific matrixes: 
the first one is user-feature concentration matrix and 
another one is an item-feature quality matrix. User- 
feature matrix measure the product feature of the item 
which the user cares.The item-feature matrix contains 
the quality of an item of the product. 
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7. CONCLUSION 

Recommender system has highly influenced the 
business.Every website getting use of recommender system 
so as to enhance their business.As more and more data are 
used in the form of reviews so this is the best idea to utilize 
that Reviews data.Texual reviews easily help to identify with 
what the user wants to say about a product.Collaborative 
filtering gives the information about the user preference to 
the other user.In other words,it recommends the item to the 
user. In this review paper, three social factors have been 
discussed which can be used in recommender systems.For 
product feature, sentiment dictionary is needed for storing 
the information. So in our future work we are going to make 
a rating prediction recommender system by using three 
social factors i.e user sentiment similarity, user circle 
influence, and item reputation.We will modify the sentiment 
dictionaries by applying the fine-grained sentiment analysis 
,so based on these it will predict the users' rating. 
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